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"THE MANUFACTURE AND EXPRESSION OF GENES FOR THAUMATIN" 

BACKGROUND 

5 The present invention relates generally to the 

manipulation of genetic materials and more particularly 
to the manufacture of specific DNA sequences useful in 
recombinant procedures to secure the production of a 
polypeptide having one or more of the biochemical or 

10 immunological properties of thaumatin. 

Thaumatin is an extremely sweet-tasting pro- 
tein produced in the arils of the fruit of the African 
shrub Thaumatococcus danielli Benth. The fruit tradi- 
tionally has been used in West Africa as a sweetener of 

15 palm wine, corn, bread, and sour fruit • Thaumatin, 
which is about 1600 times sweeter than sucrose on a 
weight basis, is produced in at least five forms: 
thaumatin I, II, a, b and c. These proteins, named in 
their order of elution from an ion exchange column 

20 [Higgenbotham, et alw in Sensory Properties of Foods 
(Birch, et al« , eds.) , London: Applied Sciences, pp. 
129-149 (1977)], have molecular weights of approximately 
22 kilodaltons, Thaumatin I and II are nearly identical 
proteins, each consisting of a single unmodified poly- 

25 peptide chain, 207 amino acid residues in length, dif- 
fering in only five amino acids. 
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Thauinatin I and II are non-toxic proteins, are 
low-calorie and non-cariogenic, and elicit profound taste 
responses suggesting a stable interaction between these 
proteins and human taste buds. Therefore, thaumatin has 
5 potential for use as a sugar substitute, food additive, 
a sweetness receptor probe and a tool for further eluci- 
dation of the taste response. 

A plentiful supply of pure thauinatin is re- 
quired to utilize the protein as a possible food additive 
10 and research tool« Because the thauinatin plant requires 
a tropical climate and insect pollination for successful 
fruit propagation, there are considerable difficulties 
involved in greenhouse cultivation of the fruit. 

Tbe entire amino acid sequence of thaumatin 
15 I has been determined directly and is set out below 
[Iyengar, et al. , Eur.J.Biochem. , 96 ^ 193-204 (1979)1. 

1 10 
NH^-Ala-Thr-Phe-Glu-Ile-Val-Asn-Arg-Cys-Ser-Tyr-Thr-Val- 

20 

20 Trp-Ala-Ala-Ala-Ser-Lys-Gly-Asp-Ala-Ala-Leu-Asp-Ala-Gly- 

30 40 
Gly-Arg-Gln-Leu-Asn-Ser-Gly-Glu-Ser-Trp-Thr-Ile-Asn-Val- 

50 

61u-Pro-Gly-Thr-Asn-Gly-Gly-Lys-Ile-Trp-Ala-Arg-Thr-Asp- 

25 

60 

Cys-Tyr-Phe-Asp-Asp-Ser-Gly-Ser-Gly-Ile-Cys-Lys-Thr-Gly- 

- 70 80 
Asp-Cys-Gly-Gly-Leu-Leu-Arg-Cys-Lys-Arg-Phe-Gly-Arg-Pro- 

90 

30 Pro-Thr-Thr-Leu-Ala-Glu-Phe-Ser-Leu-Asn-Gln-Tyr-Gly-Lys- 

100 110 
Asp-Tyr-Ile-Asp-Ile-Ser-Asn-Ile-Lys-Gly-Phe-Asn-Val-Pro- 

120 

Met-Asn-Phe-Ser-Pro-Thr-Thr-Arg-Gly-cys-Arg-Gly-Val-Arg- 

35 130 

Cys-Ala-Ala-Asp-Ile-Val-Gly-Gln-Cys-Pro-Ala-Lys-Leu-Lys- 
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140 



150 



5 

4 



10 



15 



20 



25 



30 



Ala-Pro--Gly-Gly-Gly-Cys-Asn-Asp-Ala-Cys-Thr-Val-Phe-Gln 



Thr-Ser-Glu-Tyr-Cys-<:ys-Thr-Thr-Gly-Lys-Cys-Gly-Pro-Thr- 

170 180 
Glu-Tyr-Ser-Arg-Phe-Phe-Lys-Arg-Leu-Cys-Pro-Asp-Ala-Phe- 



Ser-Tyr-Val-Leu-Asp-Lys-Pro-Thr-Thr-Val-Thr-Cys-Pro-Gly 



Ser-Ser-Asn-Tyr-Arg-Val-Thr-Phe-Cys-Pro-Thr-Ala-COOH 

The amino acid sequence for tbauinatin II has 
been deduced from its nucleotide sequence [Edens^ et al* . 
Gene , 18^ 1-12 (1982)] and a gene for thaumatin II has 
been cloned from messenger RNA-derived cDNA. The five 
amino acids in the thaumatin II sequence which differ 
from the thaumatin I sequence above are the following: 
lysine instead of asparagine at residue 46; arginine 
instead of serine at residue 63; arginine instead of 
lysine at residue 67; glutamine instead of arginine at 
residue 76; and aspartic acid instead of asparagine at 
residue 113. Sequence analysis also . indicated that 
thaumatin II is initially translated as a precursor form, 
preprothaumatin, with both a 22 residue araino-terminal 
extension and an acidic, six-amino acid carboxy terminal 
tail. The amino terminal peptide was postulated as a 
secretion signal based on its hydrophobic character and 
a compartmentalization role was hypothesized for the 
carboxy terminal extension. 

The Edens, et al. reference cited above notes 
that a polypeptide having the native sequence of prepro- 
thaumatin II has been microbially produced. More specif- 
ically, the reference and European Patent Application 
Nos. 54,330 and 54,331 disclose cDNA sequences coding 
for native mature thaumatin II and pr prothaumatin II 
and also disclose cloning vehicles comprising the DNA 



160 
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sec[uences for use in transformation in microorganisms. 



scribed in European Patent Application Nos. 54,330 and 
54,331 have a number of disadvantages. Restriction 
5 enzyme sites occur randomly and may not allow further 
manipulations of the gene such as are involved in vector 
construction. Generation of cDNA clones by the method 
of Villa-Komaroff , et al. , [ PNAS-OSA , 75, 3727-3731 
(1978) } introduces a homopolymer of GC base pairs 

10 upstream from the gene, which may cause difficulty in 

attaining efficient production of the polypeptide. Host 
microorganisms seldom possess the physiological apparatus 
for processing protein products in the same way as cells 
from which the DNA is derived. In the case of cDNA genes 

15 coding for precursor polypeptides like preprothaumatin, 
it is necessary to remove initial and terminal DNA se- 
quences and introduce microbial transcription initiating 
and translation terminating codons if the protein is to 
be produced in its mature form. 

20 The most important problem encountered by 

employing cDNA methods may be the incompatibility between 
the codons extant in the natural gene and those commonly 
preferred by the expression host. This incompatibility 
can prevent efficient translation of mRNA in the host. 

25 This may especially be true in the cas^ of thaumatin, 

where a comparison of the codons extant in the thaumatin 
II gene [Edens, et al., supra ] and those apparently pre- 
ferred by yeast [Bennetzen, et al. , J.Biol ,Chem. , 257, 
3026-3031 (1982)] and E.coli [Grosjean, et al. . Gene , 

30 18, 199-209 (1982)] show a marked divergence. 



use of cDNA-derived genes for microbial expression of 
desired polypeptides can be avoided by direct chemical 
synthesis of the gene. While total synthesis of large 
35 structural genes has been reported occasionally [see, 
Edge^ et al. . Nature, 292 , 756-762 (1981)1, synthetic 



Genes isolated by the cDNA method such as de- 



The above-noted difficulties inherent in the 
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methods have not yet been brouqht to bear in the question 
of genes codinq for thauinatin I and thaumatin II- 

BRIEF SUMMARY 

5 Provided by the present invention are manufac- 

tured genes capable of directing synthesis in a selected 
host microorganism of a polypeptide having one or more 
of the biochemical or immunological properties of 
thaumatin, e.g., thaumatin I and II. In preferred forms 

10 of manufactured genes, the base sequences include one 
or more codons selected from among alternative codons 
specifying the same amino acid on the basis of preferen- 
tial expression characteristics of the codon in a pro- 
jected host microorganism, e.g., E.coli , S.cerevisiae . 

15 Other preferred forms of manufactured genes include 

those wherein: (1) a codon specifies an additional amino 
acid in the polypeptide synthesized which facilitates 
direct expression in E.coli organisms (e.g., an initial 
methionine residue) and/or yeast organisms; and/or (2) 

20 codons specifying thaumatin polypeptides are preceded 
and/or followed by a sequence of bases comprising a 
portion of a base sequence which provides for restriction 
endonuclease cleavage of a DNA sequence (e.g., a BamH I 
site) and consequently facilitates formation of expres- 

25 sion vectors. 

Also provided by the present invention are 
fusion genes comprising manufactured genes according to 
the invention fused to a second gene capable of directing 
synthesis of a second polypeptide (e.g., L-ribulokinase) 

30 in a manner permitting the synthesis of a fused polypep- 
tide including a thaumatin polypeptide. 

In practice of the invention to generate poly- 
peptide products. DNA sequences including manufactured 
genes are inserted into a viral or circular plasmid DNA 

35 



/A^ wi?o 
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vector to form a hybrid vector and the hybrid vectors 
are employed to transform host microorganisms such as 
bacteria (e^g., E.coli ) or yeast cells (e.g.r 

S.cerevisiae ) . Vectors may also be supplied with appro- * 
5 priate promoter/regulator DNA sequences, allowing for 
controlled expression in the host microorganism. The 
transformed microorganisms are thereafter grown under 
appropriate nutrient conditions and express the polypep- 
tide products of the invention. 

10 As a further aspect of the invention, there 

are provided improved processes for selectively altering 
the nucleotide sequence of a plasmid DNA sequence having, 
respectively, at least two unique restriction or at least 
three unique restriction endonuclease recognition sites, 

15 by employing a primer oligonucleotide containing a 

selected alteration. The novel processes were developed 
in the course of developing the manufactured gene of the 
present invention. 

Other aspects and advantages of the present 

20 invention will be apparent upon consideration of the 
following detailed description thereof. 

DETAILED DESCRIPTION 

25 As employed herein, the term "manufactured" 

as applied to a DNA sequence or gene shall designate a 
product either totally chemically and enzymatically 
synthesized by assembly of nucleotide bases or derived 
from the biological replication of a product thus synthe- 

30 sized. As such, the term is exclusive of products * 
"synthesized" by cDNA methods or genomic cloning method- 
ologies which involve starting materials which are ' 
initially of biological origin. 

The following abbreviations shall be employed 

3^ herein to designate amino acids: Alanine, Ala; Arginine, 
Arg; Asparagine, Asn; Aspartic acid. Asp; Cysteine, Cys; 
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Glutamine, Gin; Glutamic acid, Glu; Glycine, Gly; Histi- 
dine. His; Isoleucine, He; Leucine, Leu; Lysine, Lys; 
Methionine, Met; Phenylalanine, Phe; Proline, Pro; 
Serine, Ser; Threonine, Thr; Tryptophan,. Trp; Tyrosine, 
5 Tyr; Valine, Val. The following abbreviations shall be 
employed for nucleotide bases: A for adenine; G for 
guanine; T for thymine; U for uracil; and C for cytosine. 

For ease of understanding of the present 
invention. Table I below provides a tabular correlation 
10 between the 64 alternate triplet nucleotide base codons 
of DNA and the 20 amino acids and transcription termina- 
tion ("stop") function specified thereby. 

TABLE I 



15 



25 



30 



35 



FIRST 
POSITION 




SECOND 


POSITION 




THIRD 
POSITION 




T 


C 


A 


G 




T 


Phe 
Phe 
Leu 
Leu 


Ser 
Ser 
Ser 
Ser 


Tyr 
Tyr 
Stop 
Stop 


Cys 
Cys 
Stop 
Trp 


T 
C 
A 
G 


C 


Leu 
Leu 
Leu 
Leu 


• Pro 
Pro 
Pro 
Pro 


His 
His 
Gin 
Gin 


Arg 
Arg 
Arg 
Arg 


T . 
C 
A 
G 


A 


He 
He 
He 
Met 


Thr 
Thr 
Thr 
Hbr 


Asn 
Asn 
Lys 
Lys 


Ser 
Ser 
Arg 
Arg 


T 
C 
A 
G 


6 


Val 
val 
val 
Val 


Ala 
Ala 
Ala 
Ala 


Asp 
Asp 
Glu 
Glu 


Gly 
Gly 
Gly 
Gly 


T 
C 
A 
6 
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The following example illustrates a preferred 
general procedure for. preparation and assembly of deoxy- 
oligonucleotides for use in the manufacture of DNA 
sequences of the invention. 

5 

EXAMPLE 1 

An entire synthetic gene for thautaatin I was 
assembled through use of 28 principal oligonucleotide 

10 fragments (designated la, lb, 2a, 2b, etc., in Table II, 
below) having a length of 25 to 39 residues and designed 
to encode the amino acid sequence of thaumatin I pub- 
lished by Iyengar, et al. , supra . The gene contained 
codons corresponding to the "favored triplets" of 

15 s.cerevisiae based on the Bennetzen, et al. , supra , 
study of highly expressed yeast gene sequences. 

TABLE II 



20 la 5'-(CCAG) TGATC ATG GCT ACC TTC GAA ATC GTT A-3* 

lb 3'-CTT TAG CAA TTG TCT ACA AGA ATG TGA CAA A-5' 

2a 5'-GG GCT GCT GCT TCC AAG GGT GAC GCT G-3' 

2b 3'CA CTG CGA CGA AAC CTG CTCGAG (TTCC) -5 ' 
**** 

3a 5'-(CCTT)GAGCTC C GGT GGT AGA CAA TTG A-3 ' 
25 3b 3'-CT GTT AAC TTG AGA CCA CTT AGG AC-5' 

4a 5'-G ACC ATC AAC GTC GAA CCA GGT AAC AA-3 ' 
4b 3'-T CCA TTG TTG CCA CCA TTC TAG A (TTCC) -5* 
5a 5'-(CCTT) AG ATC TGG GCT AGA ACC GAC TGT TAC TTC 
GAT G-3 ' 

30 5b 3'-TG AAG CTA CTG AGA CCA AGG CCA TAG ACA TTC TGA 
CC-5' 

6a 5 »-T GAC TGT GGT GGT TTG TTG AGA TGT AAG AGA TTC 
GGT-3 ' 

6b 3 '-TCT AAG CCA TCT GGT GGT TGG TGA AAC CGA CTT AAG 
35 (TCT) -5' 



OJ/IPI 
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7a 5'-{GCT) GAA TTC TCT TTG AAC CAA TAG GGT AAG GAC-3 ' 

7 b 3 ' -CCA TTG CTG ATG TAG CTA TAG AGG TTG TAG TTG-5 ' 

8a 5' -GGT TTC AAC GTT CCA ATG AAC TTC TCT CCA ACC-3' 

**** 

8b 3'-AGA GGT TGG GCA TCT CCA ACA TCT CCTCGAG{TTCC) -5 ' 
5 9a 5"-(TTCC)GAGCTC GTC AGA TGT GCT GCT GAC ATC-3 ' 
9 b 3' -CTG TAG CAA CCA GTT ACA GGT CGA TTC GAA 
(CCCTT) -5 ' 

10a 5*-(TTCCT) AAG CTT AAG GCT CCA GGT GGT GGT TGT 
AAC-3' 

10 10b 3 "-CCA ACA TTG CTG CGA ACA TGG CAA AAG GTT TGA-5 ' 
11a 5 '-TCC GAA TAC TGT TGT ACC ACT GGT AAG TGT GGT-3' 
lib 3' -TTC ACA CCA GGT TGG CTT ATC AGA TCT (CCTT)-5' 
12a 5'-(TTCC) TCT AGA TTC TTC AAG AGA TTG TG-3* 
12b 3'-C TCT AAC ACA GGT CTG CTCGAG (TTC) -5 ' 

15 13a 5»-(TTCC)GAGCTC T TTC TCC TAC GTC TTG GAC AA-3« 
13b 3'-G AAC CTG TTC GGT TGA TGG CAG TGA ACA GGT-5« 
14a 5' -GGT TCT TCC AAC TAC AGA GTT ACC TTC TCT C-3» 
14b 3'-GG AAG ACA GGT TGA CGG ATT ACT GAG CTC (CCTT) -5 ■ 

20 As often as possible, codons were employed to 

introduce unique restriction enzyme recognition sites 
at regular intervals into the sequence. In two cases, 
rarely used codons were required to form recognition 
sites at desired locations, i.e., the codon for glycine 

25 at residue 123 in Table VI, infra , is GGC, rather than 
the "favored triplet" GGT, and the codon for leucine at 
residue 138. is CTT, rather than the "favored triplet" 
TTG. As an aid to assembly of the entire gene from 
intermediate double-stranded sequences, oligonucleotides 

30 were designed to allow for the presence in intermediate 
structures of three SstI sites within (and interruptive 
of) the protein coding region. The "extra" bases needed 
to develop these sites are within parentheses and noted 
by asterisks in Table II. As discussed infra , these 

35 bases are deleted from the gene in the course of final 
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assembl/. Additional bases within parentheses in Table 
II are included to insure efficient digestion of 
duplexes formed and are not part of the intermediate 
one-pair or two-pair segments or the final gene, 

5 The 28 oligonucleotides corresponding to 

regions of the thaumatin I sequence were synthesized by 
the phosphotriester method on a solid support, with 
dimer couplings, and purified by high performance liquid 
chromatography according to Ito, et al. . Nucleic Acids 

° Res. , 10, 1755-1769 (1982)* In some cases the Ito, 
et al. process was modified to include initiating the 
assembly of each oligonucleotide from 30 mg nucleoside- 
bound (0.10-0.17 mmole/g) polystyrene resin (1% cross- 
linked) rather than the usual 50-60 mg resin. The aver- 

5 age yield of oligonucleotide per dimer coupling was 90% 
to 99%. 

The following example illustrates the manipu- 
lations performed on the oligonucleotides to assemble 
them into duplexes and combine duplexes into gene seg- 
° ments for cloning into a vector. 

EXAMPLE 2 

As summarized in Table III, below, the 28 
^ oligonucleotides synthesized in Example 1 were assembled 
into partial duplexes containing a sense and antisense 
strand with a 9 or 10 base pair overlap at the 3' termi- 
nus of each oligonucleotide. 
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The oligonucleotide of each pair which was to 

participate in a blunt-end ligation was 5*-end labeled^ 

Labeling was performed with [ P] transfer from 
32 

[y- PjAu^ oy T4 polynucleotide kinase according to the 
5 procedure of Sgaramella, et al. , J,Mol,Bio, ^ 72, 427-444 
. (1972) and complete phosphorylation was effected by 
subsequent ATP (0.5 mm) addition. All oligonucleotide 
pairs were labeled in this fashion except oligonucleotide 
pairs 9 and 12. These oligonucleotide pairs were subse- 
10 quently to be digested at both ends, so prior 5' phos- 
phorylation would not facilitate visualization of a 
doubly-digested duplex* For these oligonucleotide pairs 
radiolabeling was performed with [a-^^PldATP during the 
elongation. 

15 The labeled oligonucleotide was mixed with an 

equal amount of its unlabeled partner (50-200 ng) , heated 
to 90**C for one minute and slowly cooled to 23*»C to allow 
proper annealing. Oligonucleotides were extended at 
23**C for one hour with one unit DNA polymerase I (Klenow) , 
20 or 24 units reverse transcriptase and 0.5 mM deoxynucle- 
oside triphosphates. The enzyme was subsequently dena- 
tured and the mixture desalted. 

Each oligonucleotide pair was digested over- 
night with the appropriate restriction endonuclease and 
25 the enzyme denatured. iUialysis of the digestion mixture 
was carried out by autoradiography of a nondenaturing 
15% polyacrylamide gel and the desired band excised. 
This fragment was eluted from the gel by diffusion at 
37**C overnight and concentrated and further purified by 
passage through a BND-cellulose mini-column [Rossi, J., 
et al , , J.Mol.Biol. , 128 , 21-47 (1979)]. 

For oligonucleotide pairs 8a plus 8b, 9a plus 
9b, ,13a plus 13b, and 14a plus 14b, self -annealing of 
fragments was observed. Therefore, to direct the proper 
annealing, a third "c" oligonucleotide was synthesized 



30 



35 
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which was complementary to a region of one of the oligo- 
nucleotides and contiguous with, the overlap region. 
This "c" oligonucleotide acted as an extension of the 
overlap region and was elongated to produce the desired 
5 duplex. 

In cases where a "c" oligonucleotide was re- 
quired, the above protocol with several modifications 
was followed. The scale for duplex synthesis was 
increased due to the lower yields. 250 ng to 1 ng of 

10 each oligonucleotide (a and b) was used and a two- to 
six-fold molar excess of the phosphorylated "c" oligo- 
nucleotide was added. The three oligonucleotides were 
mixed after labeling and phosphorylation of the appro- 
priate oligonucleotide (s) , denatured, annealed and 

15 extended as before. After restriction enzyme digestion 
the duplex was purified by 15% polyacrylamide gel elec- 
trophoresis, but at a very low current to prevent melting 
of the extended "c" oligonucleotide from the duplex. 
After digestion this oligonucleotide is bound to the 

20 duplex by the hydrogen bonding of 15 base pairs (duplexes 
9 and 14) , 17 base pairs (duplex 8) or 23 base pairs 
(duplex 13) . Polymerization of approximately 30% of the 
labeled oligonucleotide was achieved thereby. 

As indicated in Table IV, below, extended and 

25 digested duplexes 9 (from oligonucleotides 9a and 9b) 
and 12 (from oligonucleotides 12a and 12b) forming a 
one-duplex segment had restriction endonuclease enzyme 
sticky ends at both termini. The extended duplexes 
designed to form a two-duplex segment (duplexes 1 and 

30 2, 3 and 4, 5 and 6, 7 and 8, 10 and 11, 13 and 14) had 

a sticky end at one terminus and a blunt end at the other. 
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The following example is directed to th prepa- 
ration of three intermediate vectors, each containing 
a sub-assembly portion of a completely assembled manufac- 
tured gene for thaumatin I. 

5 

EXAMPLE 3 

Three portions of the synthetic thaumatin gene 
were constructed concurrently in three different pBR322 

10 derived vectors. Portion A consisted of extended 

duplexes 1 and 2, 3 and 4, and 5 and 6; Portion B con- 
sisted of extended duplexes 7 and 8, 9 and 10 and 11; 
and Portion C consisted of duplexes 12 and 13 and 14. 
Plasmid plNG233 was designed to receive Portion A, plas- 

15 mid PING235 to receive Portion B, and plasmid plNG237 

to receive Portion C. As shown in Table V below, during 
construction of these plasmids new restriction enzyme 
sites were introduced into plasmid vector pBR322 by blunt 
end ligation of synthetic undecamer (11-mer) linkers and 

20 one commercially available 8-mer linker into existing 
restriction sites [Maniatis, et al, . Cell , 15, 687-701 
(1978)1. 
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Duplexes prepared as described in Examples 1 
and 2 were ligated into the appropriate vector to pro- 
duce a segment. The two segments formed by only one 
duplex, i.e., duplex 9 and duplex 12, required double 
5 digestion of the duplex and ligation to the cohesive 

termini of the vector. The six segments formed by liga- 
tion of two duplexes, i.e., formed by duplexes 1 and 2, 
3 and 4, 5 and 6, 7 and 3r 10 and 11, 13 and 14, were 
created by sticky-end ligation of each cleaved duplex 
10 to the sticky ended vector and blunt end ligation to 
each other. 

Plasmid pING233 (0.4 pmoles) was digested with 
restriction endonuclease enzymes Bgl ll and Sst I and 
dephosphorylated with calf intestinal alkaline phospha- 

15 tase^ The segment formed by purified duplexes 3 and 4 
(1 picomole each) were digested with the same enzymes 
and mixed with the vector. Ligation was carried out at 
12*C overnight with 400 units of T4 DNA ligase. This 
ligation mixture was used to transform E.coli HBlOl 

20 cells. In a similar manner, the segments formed by 

duplexes 5 and 6 and 1 and 2, respectively, were sequen- 
tially inserted into the cloning vector containing du- 
plexes 3 and 4, so that the resulting vector, pING249, 
contained Portion A of the synthetic thaumatin gene. 

25 Plasmid pING235 was manipulated in a similar 

manner to insert the segments formed by duplexes 7 and 
8, and 10 and 11 of Portion B of the gene. However, to 
clone duplex 9, further modifications in the usual pro- 
tocol were required. Polymerized duplex 9 could not be 

30 digested with Hindlll and thus was ligated as a duplex 
with one sticky and one blunt end after Sst I digestion. 
The plasmid pING235 was digested with Hind III, filled 
in with DNA polymerase (Klenow) and digested with Sst I 
to provide compatible ends for ligation of duplex 9* 

35 Thereafter the duplexes 10 and 11 and 7 and 8, respec- 
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tively/ were inserted to complete Portion B and form the 
resulting vector pIN6268. 

Plasmid plNG237, into which Portion C of the 
gene was to be inserted, also required modification of 
5 the usual protocol to insert the segment formed by du- 
plexes 13 and 14. Following digestion of duplex 12 and 
PING237 with Xba l and Sst I and ligation of duplex 12 to 
the cohesive termini of the vector, attempts were made 
to insert the segment formed by duplexes 13 and 14, 

10 Initial attempts to clone the duplexes 13 and 14 which 
had been cleaved with Sst I and Xho l, respectively, 
yielded a high frequency of recombinants, all of which 
contained only duplex 13, sticky-end ligated at the Sst I 
site and blunt-end ligated near the Xho l site of the 

15 vector. By inserting duplex 14 as a doubly-phosphoryl- 
ated, blunt-end duplex, no Xho l nuclease-sensitive over- 
hang was present. The vector was digested with Sst I/ 
PvuII and dephosphorylated; duplex 13 was not phosphoryl- 
ated. Therefore, the vector could not reclose without 

20 the phosphorylated duplex 14. The vector containing all 
of the Portion C was named pING277. 

The following description is directed to the 
screening procedures employed to ensure that proper 

25 ligation events occurred in assembly of plasmids pING256, 
PING270 and pING285. 

To detect inversions and deletions at the point 
of blunt-ended ligation, a "junction-region" probe was 
employed for use in colony hybridization. Because the 

30 percentage destabilization produced by one base mismatch 
is very great and even single base deletions at the 
junction can be discerned very easily with a short probe 
[Wallace, et al. . Nucleic Acids Research , 6, 3543-3557 
(1979)], an undecamer was designed for each s t which 

35 spanned the junction and was therefore complementary to 
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the end of each contributor to the blunt-ended ligation » 
A probe of this length hybridizes very strongly to the 
correctly ligated product. 

Colony hybridization was performed according 
5 to the procedures of Grunstein, et al. , PNAS-USA r 12 , 
3961-3965 (1975) and Wallace, et alw Nucleic Acids 
Research, 9, 879-894 (1981) with the following modifica- 
tions. Filters were prebybridized with lOx Denhardt's, 
4x or Ix SSC, 0.5% Triton X-100 and 100 iig/ml herring 

10 sperm DNA at 60^C for two hours. Synthetic junction 

32 

probes were radioactively labeled with {y- P]ATP as 
described in Sgaramella, supra . Undecamers were sepa- 
rated from unincorporated label by polyacrylamide gel 
electrophoresis. Gene fragments (25- to 39-mers) were 

15 centrifuged through a P-10 column to remove unreacted 
ATP. Hybridization with undecamer probes was performed 
at 22**C in 4x SSC for 2 to 12 hours. The filters were 
subsequently washed for ten minutes in 4x SSC at the 
same temperature three times. Gene fragments were 

20 hybridized at 50®C in Ix SSC and washed in Ix SSC at 
40®C/ for the same intervals as the undecamers. 

The following description is directed to 
sequencing of the vectors. 

25 Because the cloning of synthetic segments into 

each vector was a sequential process, the newly-inserted 
duplexes were sequenced to avoid error accumulation 
before integration of further duplexes into the same 
vector. Dideoxy chain termination sequencing was per- 

30 formed directly from linearized plasmid DNA as described 
in Sanger, et al. , PNAS-OSA , 74, 5463-67 (1977) and 
Wallace, et al. . Nucleic Acids Research , 19, 879-894 
(1981) , using synthetic oligomers complementary to por- 
tions of PBR322 or gene fragments as primers. Plasmid 

35 DNA for sequencing was prepared by a mini-lysate protocol 
[Holmes, et al. , Anal.Biochem. , 114 , 193-197 (1981)] and 
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rapidly purified on an RPC5 analog mini-column to remove 
small oligonucleotide primers [Thompson r et al. » Methods 
in Enzymoloqy r lOO r 368-399 (1983)]. With pBR322 oligo- 
nucleotides (13-mers) and thaumatin oligonucleotides 
5 (25- to 39-mers) as primers r the reactions were performed 
at 30*C and 37**C, respectively. 



colonies determined that igreater than 60% of these clones 
contained a deletion r insertion or base change. Single 
10 nucleotide changes were most frequent and (in 90% of the 
cases) consisted of G to A transitions on the synthesized 
strand, possibly caused by dimer impurity or incomplete 
deblocking of G residues before annealing and polymeriza- 
tion. 

15 The following example is directed to improved 

methods of primer directed mutagenesis employing a 
linearized single-stranded plasmid template to correct 
the several single base changes noted above. 

20 EXAMPLE 4 



the nucleotide sequence of a double-stranded plasmid DNA 
sequence having at least two unique restriction endonu- 

25 clease enzyme recognition sites was developed during the 
course of the construction of the thaumatin genes, and 
involves annealing a single-stranded primer oligonucle- 
otide containing a selected alteration to one strand of 
the double-stranded DNA sequence and extending the primer 

30 to form a partial complement thereof. The improvement 
in this mutagenesis process comprises the steps of: 

(a) linearizing the double-stranded plasmid 
DNA sequence by restriction endonuclease enzyme digestion 
at a first unique recognition site in the sequence; 



DNA sequ nee formed in step (a) into two complementary 



Sequence analysis of hybridization-positive 



An improved process for selectively altering 



(b) denaturing the linearized double-stranded 
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linear single-stranded DNA sequences; 

Cc) annealing the primer to one of the linear 
single-stranded sequences formed in step (b) and ex- 
tending the primer to form a partially double-stranded 
5 DNA sequence; 

(d) denaturing this partially double-stranded 
sequence formed in step (c) into the original plasmid- 
derived single-stranded sequence and a primer-derived 
single-stranded DNA sequence; 
IQ (e) linearizing the double-stranded plasmid 

DNA sequence by restriction endonuclease enzyme digestion 
at a second unique recognition site in the sequence; 

(f) denaturing the linear sequence formed in 
step (e) into two complementary single-stranded DNA 

15 sequences; 

(g) annealing the primer-derived single- 
stranded DNA sequence formed in step (d) to a plasmid- 
derived complementary single-stranded sequence formed 
in step (f ) ; 

20 (h) recircularizing the annealed strands of 

step (g) into a double-stranded DNA plasmid with an 
alteration in one strand; 

(i) transforming a host microorganism with the 
plasmid formed in step (h) and isolating daughter cell 

25 populations containing plasmids with the selectively 
altered sequence by hybridization with the primer; and 

(j) transforming a host microorgaiiism with the 
selectively altered plasmids obtained from the hybridiza- 
tion of step (i) . . 

30 As one example r this complementary strand 

mutagenesis was employed to cause an A to C transversion 
in the codon specifying threonine at position number 154 
of the thaumatin I sequence (see Table VI ^ infra). The 
plasmid manipulated contains duplexes 9 to 11 between 

35 the Sst I and Xba l sites of pBR322 derivative pING235. 
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After digestion of the super-coiled plasmid with a 
restriction endonuclease enzyme , PstI, one microgram of 
the now-linearized plasmid was mixed with a twenty-fold 
molar excess of a phosphoryla ted mutagenic primer having 
5 the correct sequence, 5 ' -CAAA^TTCCGA-3 ' . In 25 |il poly- 
merization buffer the linearized double-stranded plasmid 
was denatured and the primer reannealed as follows. The 
mixture was sealed in a capillary tube, incubated at 
100**C for three minutes and plunged into an ice water 

10 bath for several minutes. Prior to polymerization the 
mixture was pre-incubated at the reaction temperature, 
12**C, for ten minutes. 

Primer elongation was carried out in 0.5 mM 
deoxynucleoside triphosphates, 5 mM DTT and 1 unit DNA 

15 Polymerase I, Klenow Fragment at 12*C for thirty minutes 
followed by 37^C for two hours. The mixture was then 
heated at lOO^C for one minute to denature the polymerase 
and mixed with pING235, linearized by digestion with 
1 Ava l and dephosphorylated. The reaction mixture 

20 was diluted to 100 |il, sealed in a capillary tube, heated 
to 100**C for three minutes and .incubated at 60*C for two 
hours to reanneal the extended primer to a complementary 
strand. The desired hybrid contains different locations 
for the restriction endonuclease enzyme recognition site 

25 at which each strand was cleaved; thus, when completely 
annealed, a circular structure is formed. The mixture 
was desalted by centrif ugation through PIO, lyophilized 
and dissolved in 20 [il polymerase buffer. A complete, 
circular duplex was created by incubation with 400 units 

30 T4 DNA ligase, 0.5 mM deoxynucleoside triphosphates, 

0.5 mM ATP and 1 unit DNA Polymerase I, Klenow Fragment 
overnight at 12'*C. After transformation of E.coli HBlOl, 
colonies were screened by hybridization using the primer 
as probe. 

An alternative improved process was developed 
for sel ctively altering the nucleotid sequence of a 
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double-Stranded plasmld DNA sequence having first r second 
and third unique restriction endonuclease enzyme recogni- 
tion sites, wherein a single.-stranded primer oligonucle- 
otide containing the selected alteration is annealed to 
5 one strand of the double-stranded DNA sequence and ex- 
tended to form a partial complement thereof, and the 
plasmid DNA sequence to be altered is between the second 
and third recognition sites. The improvement coinprises 
the steps. of: 

10 (a) linearizing the double-stranded plasmid 

DNA sequence by restriction endonuclease enzyme digestion 
at the first recognition site in the sequence; 

(b) denaturing the linearized double-stranded 
DNA sequence formed in step (a) into two complementary 

15 linear single-stranded DNA sequences; 

(c) annealing the primer to one of the linear 
single-stranded sequences formed in step (b) and 
annealing to the same linear sequence a second primer 
complementary to a portion of the sequence between the 

20 first and second recognition sites and extending the 

primers to form a partially double-stranded DNA sequence 
including both the second and third recognition sites;: 

(d) cleaving the partially double-stranded 
sequence formed in step (c) by restriction endonuclease 

25 digestion at the second and third restriction sites to 
form a double-stranded sequence which is a hybrid of a 
plasmid single strand and the primer extended strands; 

(e) inserting the fragment formed in step (d) 
into the DNA plasmid to form a double-stranded plasmid 

30 including the alteration in one of its strands; 

(f) transforming a host microorganism with the 
plasmid formed. in step (e) and isolating daughter cell 
populations containing double-stranded plasmids with the 
selectively altered sequence by hybridization with the 

3^ primer; and 
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(g) transforming a host microorganism with the 
selectively altered plasmids obtained. from the hybridiza- 
tion of step (f ) • 

As one example, this fragment excision/ 
5 religation mutagenesis technique was employed to correct 
a G to A transition in the center of the codon specifying 
glycine at position number 144 of the thaumatin I gene. 
The second primer was employed upstream from the 
mutagenic primer, which, when elongated, reformed the 

10 recognition site. One microgram of super-coiled plasmid 
containing duplexes 7-11 inserted between the EcoRI and 
Xba l sites of pING235 was digested to completion with 
restriction endonuclease enzyme Pstl. This plasmid was 
combined with 30 ng (6 praole) each: (a) phosphorylated 

15 mutagenic primer containing the alteration, 
5'-CAACCA^CACC-3'; and (b) a pBR322 primer, 
5 ' -GTTGAAG6CTCTC-3 • / which is complementary to the 
sequence adjacent to the Sai l site and primes counter- 
clockwise. The mixture was denatured and the primers 

20 annealed as described above for the complementary strand 
mutagenesis. Prior to polymerization and ligation the 
solution was pre-incubated at the reaction temperature 
(12*0 for five minutes and then the solution was 
adjusted as follows: 0.5 mM in each dNTP, 5 mM DTT, 1 

25 unit DNA Polymerase I, Klenow Fragment, 0.5 mM in ATP, 
and 400 units T4 DNA ligase. The mixture was incubated 
at 12®C overnight, the enzymes denatured and the restric- 
tion enzymes Xba l and EcoR I used to excise a portion of 
the newly-copied region. This was mixed with the equiv- 
alently digested, dephosphorylated pING235-der ivative 
plasmid containing duplexes 7-11 and ligated at 12®C 
overnight. The mixture was used to transform HBlOl and 
ampicillin-resistant colonies were screened by colony 
hybridization using the primer containing the alteration 
as the probe. 
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The following example is directed to the final 
assembly of the gene from the corrected portions A, B 
and C of plasmids pING249, plNG268 and pING277. 

5 EXAMPLE 5 

After correction of all the nucleotide errors 
by ^n vitro mutagenesis in Example 4, the Sst I site 
(5'-GAGCTC-3') of each completed gene portion was 

10 destroyed fay removal of the AGCT sequence. Bach plasmid 
was digested to completion with Sst I and adjusted to 
0.5 mM in each deoxynucleotide triphosphate. Removal 
of the 3' single-stranded end was accomplished by the 
addition of 1 unit of DNA Polymerase I (Klenow) and 

15 incubation of the mixture at 22^C for 15 minutes. T4 
DNA ligase was then added (400 units) and the solution 
placed at 12**C overnight. The ligation mixture was used 
to transform HBlOl and ampicillin-resistant colonies 
were screened for the Sst I site. Following these proce- 

20 dures/ the resulting plasmids were re-designated. 
PING249 became pING250; pING268 became pING270; and 
PING277 became pING278. 

pniG250 was thereafter digested with Bel l 
enzyme and blunt-ended by SI nuclease. A BaroH I linker 

25 of sequence 5 ' -CGGGATCCCG-3 • (New England Biolabs) was 
inserted between the blunt ends to generate pING256. 
An additional "stop" codon was introduced into pING278 
by primer-directed mutagenesis to generate plNG285. 

The three completed portions of the gene 

30 present in plasmids plNG256, plNG270 and pING285 were 
then combined into one complete gene sequencer as set 
out in Table VI, below. 



wo 85/01746 



PCT/US84/01564 



- 27 - 



(0 



u 

<D 
CQ 

(0 

< 

as 
u 



3 
i-H 

(0 



0) 



ou 
^< 

lis 
gs 

US 
si: 

u a 

ou 
^< 
<^ 





O O 




U C5 




< Eh 


a 


a a 


If 

M 


rn r 1 








Cj o 


(1) 


u a 








si: 


w 


Si* w 


>i 




i-f 


o u 




a o 


M 




01 


u o 


CQ 




C 




CO 

< 


si: 


3 


a o ^ 


0) 






i:S" 


c 

O 


.si: 


en 




cn 










ro 


>i 


&^ <: 


iH 


u o 


O 


a o 


>1 




l-l 


CD U 


o 


o u 




oo 


r-l 


u o 


< 


a o 


Ql 




CQ 




< 






CD U 




i:S 


03 






u CD ia 


<< 


CD U CM 


to 


^ < 


r-f 


O CD 


< 


CD a 




U CD 


CQ 




< 


CD U 






O iH 


CD U 


CM O 


CD U 


CQ 


a cj 


>1 
i4 


si: 





U CD 






&^ 




CO 




>i 


CD U 


U (0 H < 


in 


Oi 


CJ CD 


CO 




< 


CD <J 


u 












at 




u 


CD CJ 


< 


rt: EH 


as 






CJ CD 


< 


CD U 


cu 


CD a 


u 








<u 


us 


O iH 




M LTt M 


< Eh 


M 




CO 


a o 


ai >i 


si: 


>i 


H <; 


M 


CD U 


CD 


CD O 



c 

O CO 



Eh <: 
CD U 
CD a JQ 

CJ CD 

EH 

< E^ 

Si g 

&H < 
CD a 

CD CJ 

< &H 

U CD 
U CD 

< Eh 

< Eh 

CD a 
a CD 

ro Eh 

> m CD u 
a CD 





1 <E^ 




CDO 


< m < Eh 




3 


CD U 






3 


CD CJ 


Al 
U/ 








I-H 


O CJ 


CD 


CD U 


>i 


^ < 


I-H 


CD a 


CD 


CDO 


CO 


&t < 


>i 


CD CJ 




EH < 


CU 


U CD 


CO 




< 


CD O 




< 


I-l 


o a 


CD 


CD a 


M 


EH < 






Eh 


sg 


CO 




>i 
i4 


si: 


to 




>1 




o 





si: 

CJ CD 

H <; 

< Eh 



Of 

I-H 
M 

>1 

CD 

u 
<U 
CQ 

>i 
rH 
CD 

Wl 

CQ 

01 

O QE> 

\o < 

CO 

•< 

JZ 
Pi 



.2J 



<: 

CD KJ 
CD CJi3 

m 

UCD 
O CD 
EH < 

EH < 

CD CJ 
CD O 

^1 < 

u a 
CJ a 

•a; Eh 
CD CJ 

&^ < 

CD CJ 

O CD 

i:S 



wo 85/01746 



PCTAIS84/01564 



- 28 - 



Wt 

a 
o 



U C5 
^ < 



3 

(U 



c 
o 



13 













<u 




CQ 




(U 




O J= 


M 91 CU 






8 






1 - 




O 




<D 




r-f 




Kt: 












a 












Eh 




kt 




JC 




&4 




O 












o 




u 




Qi 




D» 




ki 




< 












o 




o 




ox: 












u 




< 




ca 




>i 




a 




CO 








o 



&4 < 

u o 
frt < 

o a 

&4 < 

< Eh 

< E^ 
CD O 

E^ < 

o a 
a u 

CD o 
E^ < 

EH <: 

Eh .4: 
a CD 

< EH 

a C5 
u c? 

< E* 

< Eh 
O CD 
U CD 

EHf 
O CD 

o a 

< EH 
CD U 

< EH 

E« < 
O U 
CD U 

OCD 
EH < 

EH < 

< Eh 

CD a 

< EH 

II 

E* < 
CD CJ 
EH < 



01 
JZ 

C- 
O 

s 





o 












O i-H 




f 
































>i 








CD 




CQ 












CI 








M 




C 




03 




< 








<D 




CQ 




Of 


> 




M 


M 


OS 




0 




a 


CD 


;^ 


1 < 




o o; 




O f-l 




i-H w 




u 




>1 




EH 




a 




ca 




< 




ca 




>i 








>i 








crx 



Eh 

C5 O 
EH < 
Eh 

GO 

Eh 
CD 
CJ CD 

Eh < 
EH < 
CD O 

o a 

< EH 

< EH 

a o 

Eh < 
CD U 
CD U 

9 y 

< Eh 

< Eh 

O CD 
Eh < 

< ^ 

< Eh 

< EH 

U CD 
U CD 
EH < 

U O 
EH < _ 

< Eh 

Eh < 

< EH 
CD U 

O CD 
Eh <: 
-cC EH 

CJ O 

< EH 

Eh < 



< Eh 
CD O 

CD U 

< Eh 

< Eh 

Eh < 
CD O 
CD KJ 



S 

a 
>t 
a 



III: 

CJO 
Eh -ij 

CD a 

CD CJ 
CD U 



o 01 I a CD 

CO rH CO £h < 
i-l IH 0\ < Eh 



Qi 
CD 
< 

iH 



CO 
CJ 



(0 

> 



CD 

< 

CO 

>1 
u 

O >t 
CS r-H 
.HO 

on 
u 
< 

EH 

u 
JZ 
EH 

o 
w 

Ck 
u 

0 
CQ 



CJ CD 

< EH 

CD a 

Eh 

CJ CD 
CD O 

Eh <: 

CJ a 

CD u 

EH !< 

CD CJ 
EH < 

< Eh 
CD U 

< Eh 

CJ CD 
EH < 
CD O 

O CD 
CD U 
CD U 

< Eh 
CD CJ 

< EH 

Eh <C 
CD CJ 
EH < 



EH 

CD U 

CD a ^ 

CO 

< EH 
CD CJ 

< Eh 

Eh < 
U KJ 

< CD 

CJ CD 
CJ CD 

< EH 

< Eh 
CJ CD 
CJ CD 

Eh < 



> 

o w 

r-H EH 
CO 
CJ 

rH 

CO 

< 

C 

m 
< 

CO 

>i 
CJ 

>1 
f-l 
CD 

>i 

r-l 

CD 
>i 

I-H 

CD 

O 
u 
Oi 

O (Q 
^ iH 

cn 

M »4 

M 

C 01 

CQ 
>i 
>4 

<0 
r-C 
< 

o 

Wl 

CO 

u 



Eh rt! 
Eh < 
CD O 

CJ CD 
O CD 

< EH 

EH ^ 
O CJ 
EH < 

U CDO 
CD U 

CJ O 

< Eh 
CD O 

O CD 

^i: 

Eh < 
CD CJ 
EH < 

EH < 

a CJ 

(0 CD U 



rH Eh < 
CD CJ 
CD CJ 

Eh <d 

o a 

CD CJ 

< Eh 
O CD 

a CD 

Eh .< 
O CD 
CD CJ 

CD O 

CJ CD 
CD U 

^i: 

EH < 

CJ a 

CD CJ 

< EH 

O CD 
CJ O 

Eh < 
CD CJ JQ 
Eh<o\ 



wo 85/01746 PCTAJS84/01564 



- 29 - 



C 

o 
u 



> 



M < 
(0 

o u 

U 

IT 

0 
iH 

u 
o 

u 

P4 
>1 

C5 

CO 

>1 
a 

ca 
>i 
1^ 

>i 

rH 
C3 

O vr 

r-l H 

01 
>i 
CJ 

CQ 
>1 
U 

u 

>i 
&^ 



a 

M 
CQ 



< EH 



o a 
u a 

u a 

Eh icC 

o o 

CD O 

< 
^ < 

III: 

^ < 

O CJ 
O CJ 

^ < 

rH < M 

a o 
&^ 

< 
a CJ 

en < 
a CJ 

CJ K3 

KD CJ 

CJ CD 
CJ C3 



CU 

04 



III: 

CJ a 

CJ CD 



> 

O ki 
w 

O 

u 
Oi 

CO 

>1 

Q* 

00 

1^ 



u 

EH 

u 

<u 

CQ 
<U 

P4 

O 03 
CO i-l 
rH<C 

Q4 

03 
< 

O 

u 

oa 
>i 
CJ 

3 

< 

CO 

>1 

JZ 

04 



U CD 
Eh < 
CD O 

a CD 

CJ CD 

&4 < ja 

CJ CD 
»«a: E^ iH 

< £H 
CJ CD 
CJ CD 

§1^ 

a CD 

< £H 

CD U 

CD U 
(0 Eh itf 
m Eh < 

CD U 

a CD 

< H 

Eh < 

O CD 
CJ CD 

e< 

CJ CD 

< 

CJ CD 
CD U 

CJ CD 

< M 
CD CJ 

CJ a 

CJ CD JQ 
CM 

Eh < 
CD CJ 

CD CJ 

i:^ 

< Eh 
CD CJ 

< Eh 



CD a 

(0 £h 

CN 2 H 
rH 

CJ a 

i:^ 

CJ CD 
EH < 
H < 



(a 

O i-H 

0 

cu 

CO 
>f 
CJ 

o 

P4 
Wl 



O D> 
O W 

w 

>1 

Eh 
C 

xn 
< 

u 

CQ 
U 
CQ 



CD 

O 
u 

CM 

QQ 
>l 
CJ 

u 
JZ 

E4 



Eh 
CJ 

H CD 
01 •< 
i3 O CD 
X 

< EH 
U CJ 
< 



E^ < 

CJ CD 
U CD 
CD U 

E^ i< ^ 
U CD i-H 
!< Eh 

< Eh 
U CD 
U CD 

E^ <: 

CD CJ 
EH < 

U CD 

nil 

CJ CD 
U CD 

< Eh 

CD CJ 

<^ 
CD CJ 

< EH 

O CD 

< EH 

EH < 

c; CD 

CJ CD 
U CD 

EH <: 

Eh < 
CJ CD 

E^ < 
CD CJ 
CD CJ 



< ^ 

a CD 
CJ a 

EH *C 

Eh < 
CJ CD 

< EH 



WO85/01T46 



PCTAJS84/01564 



- 30 - 

Simultaneous integration of two gene portions 
into the vector containing the thirds PING285, was 
accomplished by taking advantage of the requirement for 
an intact ampicillin-resistant gene in viable colonies 
5 and destroying the integrity of this gene on the two 
plasmids donating remaining gene portions. Thereafter, 
plasmids pING256, containing duplexes 1-6, and pING270r 
containing duplexes 7-11/ were linearized with Pst I and 
the resultant 3' single-stranded regions were degraded 

10 by the action of DNA polymerase I (Klenow) • After 
ligation, the ampicillin-resistant gene contained a 
frameshift which prevented expression of an active gene 
product. The gene portions were subsequently excised 
with BamHI and EcoRI for pING256 and EcoRI arid Xba l for 

15 pING270. The vector which accepted the two restriction 
fragments (PING285, which contained oligonucleotide 
duplexes 12-14) was alkaline phosphatase treated to 
prevent closure without the phosphorylated restriction 
fragments from pING256 and pING270. The resulting 

20 plasmid, containing the entire synthetic thaumatin I 
gene, was named pING301. 50% of the colonies isolated 
contained the expected restriction map of the thaumatin 
gene. Sequence analysis demonstrated that the entire 
coding sequence was present. 

25 The following example is directed to the 

construction of a fused gene comprising the entire 
thaumatin coding sequence of plNG301 inserted in the 
proper reading frame immediately following the first 426 
nucleotides of the S.typhlmurium araB gene. 



30 



EXAMPLE 6 



In this construction, plasmid pMH6, a deriva- 
tive of pMH3 [Horwitz, et al. . Gene , 14, 309-319 (1981)], 
was employed. Plasmid pMH6 harbored in E.coli strain 
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MC1061 has been deposited with the American Type Culture 
Collection, 12301 Parklawn Drive, Rockville, Maryland 
20852, under the identifying number ATCC 39450. pMH6 
was digested with Sai l and EcoRI and the Ml3mp9 poly- 
5 linker fragment [Messing, J., in Third Cleveland 

Symposium on Macromolecules; Recombinant DNA , (Walton, 
A., ed.) Elsevier, Amsterdam 143-153 (1981)] was inserted 
into the digested plasmid. In order to direct the inte- 
gration of the thaumatin I gene, an Xho l linker was 

10 ligated into the T4 DNA polymerase filled-in Eco RI site. 
This resulting vector, pING61, was digested at the Sai l 
site of the araB gene, and the single-stranded region 
was filled in with DNA polymerase I (Klenow) , thereby 
blunt-ending that site. After subsequent Xho l digestion, 

15 this vector was ligated with the thaumatin gene derived 
from PING301 ( BamH I-digested at the 5* end of the 
thaumatin I gene, filled in, and Xho l-digested) to form 
plasmid pING307. Competent E.coli strain MC1061 
[Casadaban, et al. , J.Mol.Biol. , 138 , 179-207 (1980)] 

20 cells were used for transformation. Clones containing 
the araB gene of pMH6 fused to. the entire thaumatin I 
gene were obtained and confirmed for precise joining by 
sequence analysis of the fusion junction. 

The following example is directed to an alter- 

25 native vector for obtaining E.coli expression of the 
thaumatin I gene constructed in the foregoing examples. 

EXAMPLE 7 

The commercially available tac promoter 
30 [Russell, et al. . Gene , 20, 231-243 (1982)] was used to 
achieve expression of the thaumatin gene in E.coli . The 
vector used, pDR540, [PL Biochemicals, Milwaukee, Wis.] 
contains a BaraHI site directly after the Shine-Dalgarno 
sequence of the tac promoter, facilitating insertion 
35 and expression of a synthetic gene without a ribosomal 
binding site. Downstream from this site is a galactose 
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kinase g ne, also under control of th tac promoter. 
The Xhol site at the 3* end of the thaumatin gene was 
converted to BamH I by digestion of pING301 with Xhol, 
filling in with DNA polymerase (Klenow) and blunt-end 
ligation with an Xho l linker (Collaborative Research, 
Inc.). The thaumatin gene was removed from the resulting 
vector {pING302) by digestion with BamH I ^ purified from 
the remainder of the plasmid by agarose gel electropho- 
resis and ligated with BamHI-digested pDR540 at 12*»C 
overnight with 400 units T4 DNA ligase to yield plasmid 
PING304. This plasmid was used to transform E.coli 
strain 71.18, an E.coli JMlOl (ATCC 33876) derivative 
with a change in one gene locus. E.coli strain 71.18 
may be produced by employing the standard procedure of 
Miller r J.H., Experiments in Molecular Genetics , Cold 
Spring Harbor Laboratory, Cold Spring Harbor, New York 
(1972). Briefly put, a culture of E.coli cells (ATCC 
33760) which carries the tra gene is infected with 
bacteriophage PI (ATCC e25404-Bl) • The bacteriophage 
then lyses the cells and packages the chromosomal tra 
gene DNA. JMlOl cells are thereafter transduced with 
the tra gene-carrying phage, resulting in E.coli 71.18. 
Isopropyl S-D thiogalactoside (IPTG) induces the tac 
promoter, which is normally repressed in strain 71.18. 
Ampicillin-resistant colonies were screened by colony 
hybridization with the thaumatin gene oligonucleotide 
5b (see Table II) as a probe. Positive colonies were 
screened for the proper thaumatin orientation by mapping 
with EcoRI digestion. 

The following example is directed to determi- 
nation of the stability of the thaumatin I polypeptide 
produced in the expression systems of Examples 6 and 7. 

EXAMPLE 8 

Cultures were grown in minimal medium con- 
taining glycerol (0.2%) as a carbon source, thiamine 
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(2 iig/inl)f and ampicillin (100 v-g/ml) for strain 71.18 
containing pING304 -and- with the addition of leucine 
(0.004%) for strain MC1061 containing pIN6307. 

To perform pulse-labeling experiments with 
5 [^^SI cysteine r cells were grown to an Ag^Q of 0.1 and 
induced with 1% arabinose (MC1061) or 1 mM isopropyl 0-D 
thiogalactoside (71.18). After a thirty-minute incuba- 
tion, 10 liCi (^^SJcysteine or [^^SJmethionine (10 praoles) 
was added. The cells were chilled on ice after the 
10 5-minute desired labeling period, pelleted by centrifuga- 
tion for one minute in a microfuge, and lysed by boiling 

for five minutes in 20 mM Tris-Cl, pH 7.2, IS SDS. 

35 

Pulse-chase experiments with [ S] cysteine to determine 

the stability of synthesized thaumatin in vivo were 

15 performed equivalently, except that unlabeled cysteine 

(50 ng/ml) was added after a five-minute labeling period. 

Cells were harvested at varying times thereafter. 

For MC1061 cells transformed with plasmid 

PING307 including the araB -thaumatin fusion gene of 

20 Example 6, autoradiography of SDS-polyacrylamide gel 

35 

analysis of [ S] cysteine-labeled extracts demonstrated 
the presence of a fusion protein of the expected molecu- 
lar weight, 36 kilodaltons. This protein was not visi- 
ble in the uninduced extracts. Neither a radioimmuno- 

25 assay nor a gel stained with Coomassie Blue were suffi- 
ciently sensitive to reveal the presence of this fusion 
polypeptide. 

In 71.18 cells transformed with plasmid 
pING304 having the thaumatin gene under tac control, 

30 during a five-minute pulse labeling with [^^SJcysteine 
a protein the size of thaumatin was produced. After a 
five-minute chase greater than 90% of the thaumatin had 
been degraded, indicating that this protein is unstable 
in this bacterial strain. Neither uninduced nor 

35 [^^sjmethionine-labeled extracts produce this protein 
at detectable levels. This result is consistent with 
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the frequency of occurrence of these two amino acids in 
thaumatin (16 tinses vs* 1 time). As in the fusion pro- 
tein case, neither a radioinununoassay nor a Coomassie 
Blue stained gel detected the presence of this protein. 
5 The following example is directed to manipula- 

tions performed to confirm the identity of the fusion 
and thaumatin polypeptides, produced in the previous 
examples • 

10 EXAMPLE 9 

Anti-thaumatin antibodies were raised by 
multiple subcutaneous injections of adult male New 
Zealand rabbits with 400 (ig thaumatin in Freund*s Com- 
15 plete Adjuvant, Booster injections were with 80 jig 
thaumatin. The IgG fraction was isolated by ammonium 
sulfate precipitation [Weickman, J.L., et al. , Biochem. , 
20, 1272-1278 (1981)1. 

SDS-lysed, radiolabeled extracts of induced 
20 cells were immunoprecipitated with the above antibody 

preparation as described by Sen, et al. , [ PNAS-PSA , 80 , 

125 

1246-1250 (1983)]. ["^"l] iodothaumatin as a gel standard 

was prepared as described by Weickmann, et al. , [ J.Biol. 

Chem. , 257r 8705-8710 (1982)] for iodinating proteins. 
35 

25 [ S]cysteine-labelled cell extracts were 

immunoprecipitated, electrophoresed through a polyacryl- 
amide gel and autoradiographed. Extracts from pING304 
transformed 71.18 cells (Example 7) showed one protein 
band, identical in mobility to a thaumatin standard. 

30 In contrast, extracts of pING307 transformed MC1061 

cells of Example 6 showed both a protein of the predicted 
size and a band of slightly higher mobility* This second 
band occurred only in induced extracts with immune, but 
not pre-immune serum. Pulse-chase experiments demon- 

35 strated that the initial five-minute labeling produced 
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only the larger of these two proteins. During the subse 
quent chase ^ period the higher mobility band appeared and 
became equal in intensity to the original band after 
twenty minutes.. This lower molecular weight, immunopre- 
5 cipitable protein may be a specific degradation, product 
of the fusion polypeptide. 

The following example is directed to vector 
constructions for expression of the thaumatin gene in 
yeast cells. 

10 

EXAMPLE 10 
A. The Construction of pING60 

15 To express the thaumatin gene in yeast, the 

PGK gene from Saccharomyces cerevisaie was isolated from 
a yeast genomic bank using colony hybridization (as 
described in Example 3) with a 17-mer synthetic probe 
complementary to the published PGK promoter sequence 

20 [Dobson, et al. . Nucleic Acid Res. , 10, 2625-2637 

(1982)]. A 3kb Hind lll fragment containing the PGK gene 
was subcloned into E.coli pBR322 to generate hybrid 
plasmid pPGK-p. pPGK-p was thereafter digested with 
Mbo II to obtain a 218 bp Mbo II fragment containing the 

25 proximal end of the PGK promoter from pPGK-p. This 

fragment was reacted with T4 DNA polymerase to produce 
blunt ends and ligated to Bell -digested and blunt-ended 
PING250 (Example 5). The resulting plasmid, pINGSl, 
contained an Mbo II site re-created at only the 5* end 

30 of the PGK promoter. Plasmid piNGSl was redigested with 
MboII, and treated with T4 DNA polymerase to produce 
blunt ends. After attachment of BamH I linkers at the 
blunt-ended Mbo II site, pING51 was digested with EcoR I 
to remove the PGK-thaumatin (portion A) sequence, which 

35 was thereafter joined to Bam HI and Eco RI digested 
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PIN6301. Tb complete tbaumatln sequence was contained 
in the resulting plasmid, plNG52. 

Plasmid pING52 was digested with BainHI, filled 
in with T4 DNA polymerase r and then digested with EcoRI 
to generate a P6K promoter-thaumatin (portion A) fragment 
which contained a blunt end at the 5* terminus and a 
EcoR I Sticky end at the 3' terminus. pPGK-p was digested 
with Xba ly treated with T4 DNA polymerase to fill in the 
endsr and digested with Eco RI , thereby removing a portion 
of the plasmid 's intact PGK structural gene. The inser- 
tion of the PGK promoter-thaumatin (portion A) fragment 
into the digested pPGK-p formed pING55a, which thereby 
contained tandem PGK promoters. A deletion was generated 
between the duplicated promoter regions through in vivo 
recombination in E.coli y placing the intact PGK promoter 
immediately upstream from the 5' end of the thaumatin 
gene. A representative clone, designated pING56a, con- 
tained only portion A of the thaumatin gene. The PGK- 
thaumatin (portion A) fragment was removed from plNG56a 
by a BamH I /EcoR I double digestion and joined to the same 
restriction sites in plNG52r resulting in plasmid pING57 
which contained the complete PGK promoter and transcrip- 
tionally active thaumatin gene on a BamH I /Xho I fragment. 

To create an E.coli -yeast shuttle vector to 
receive this BamH I /Xho I fragment, plasmid pPGK-p was 
digested with Bgl ll and EcoR I and treated with T4 DNA 
polymerase to produce blunt ends. Xhol linkers were 
attached, and the plasmid was religated and transformed 
into B.coli strain MC1061 to generate pING53, in which 
the PGK terminator was located between the Xhol and 
Hindlll sites. A BamH I /Hind lll fragment containing the 
PGK terminator and 190 base pairs of pBR322 DNA was then 
joined to the yeast -E.coli shuttle vector, pJDB209 
[Beggs, J., "Multicopy Yeast Plasmid Vectors* in Molecu- 
lar Genetics in Yeast f von Wettstein, et al. , eds. 
(Copenhagen 1981)1, at the BamHl and Hindlll sites to 
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cr ate pING58. Plasmid pJDB209 harbored in host cell 
E,coli K12 strain MC1061 has been deposited with the 
American Type Culture Collection, 12301 Parklawn Drive, 
Rockville, Maryland 20852 and designated ATCC 39449 • 
5 The BainH I /Xho I fragment from pING57 was then ligated to 
the same restriction sites in pING58, resulting in the 
PGR terminator being positioned downstream from the 
thaumatin gene. 



10 containing the thaumatin gene was inserted between 
the yeast phosphoglycerate kinase gene ( PGR ) promoter 
and terminator. Yeast strain AH-22 [Ito, et al. , 
J. Bacteriology , 153 , 163-168 (1983) 1 (ATCC 38626) was 
used as a recipient for expression of the synthetic 

15 thaumatin gene in pING60. 



cultured in SD(-)leu medium, a synthetic complete medium 
, consisting of nitrogen base without amino acids, 2% 
glucose, amino acid supplements without leucine, and 

20 purine and pyrimidine supplements, since the PGR promoter 
in this plasmid is constitutive when cells are grown in 
the presence of glucose as the sole carbon source. 
Single colony yeast transformants containing the plasmid 
PING60 were inoculated into 15 ml of SD(-) leucine broth 

25 and grown to saturation at 30®C with vigorous shaking 

^^600nm ^PP^^^i^^^^^y 2.1). The saturated cultures were 
preceded on ice and washed twice with PBS before lysis. 



intermediate plasinids were employed for the purpose of 
contributing the raf f inose-inducible yeast cycl promoter 
(PING103) and the cycl terminator (pING108) . Both 
PING103 and pINGlOS have been deposited with the American 
Type Culture Collection, 12301 Parklawn Drive, Rockvill , 



The resultant chimeric plasmid plNG60 



Yeast strains carrying the plasmid pING60 were 



B. The Construction of the Plasmid pING114 



30 



In the construction of plasmid pING114, two 
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Maryland 20852, harbored in bacterial host strain E.coli 
K12 strain HBIOI. .These deposits are identified by ATCC 
No. 39447 and No. 39448, respectively. In order to 

facilitate fusion of the cycl terminator to the thaumatin 
5 gene in the proper orientation, the Sai l site 5' to the 
cycl terminator and the Hind lll restriction site 3' to 
the terminator on pINGlOS were replaced with Xhol and 
Sai l sites by linker addition, respectively, generating 
piNGllO in which the direction of transcription proceeds 

10 from the Xho l site to the Sai l site. 

Plasmid pING301, containing the 650 base pair 
thaumatin gene bounded 5' by a BamH I site and 3' by a 
Xho l site, was BamH I digested and blunt-ended by treat- 
ment with DNA polymerase (Klenow) • Following digestion 

15 with Xho l, the 650 base pair blunt-end-XhoI thaumatin 
fragment was cloned into Xhol/PvuII-digested pINGllO, 
thereby regenerating a BamH I site at the 5' end of the 
thaumatin gene and resulting in fusion of the 3* end of 
the thaumatin gene to the cycl terminator in plasmid 

20 piNGlll, 

Plasmid pINGlll was digested with Sai l and 
blunt-ended and then digested with BamH I. The thaumatin- 
cycl terminator fusion was cloned into the high copy 
number yeast vector, pJDB209 between the BamH I site and 

25 the Hindlll site which was converted to a blunt end, 
generating pING112. 

To remove the undesirable first 7 base pairs 
of the cycl gene coding region, which occur between the 
, 5* Sai l site and the cycl promoter in plNG103, pING103 

30 was cleaved with EcoRI and a limited Bal31 exonuclease ^ 
digestion was performed. The plasmid was then digested 
with Sail, generating an array of cycl promoters that " 
have a common Sai l site at one end and a different Bal 31 
deletion end point at the other. Th cycl fragments 

35 with Bal 31 deletion sites determined by gel electropho- 
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resis ranging between 4 and 20 base pairs were isolated 
and cloned into pING112 digested with Sai l and BamH I and 
blunt-ended. 

The plasmid pING114 contains the synthetic 
5 thaumatin gene fused between the raffinose inducible 
yeast cycl gene promoter and terminator. Yeast cell 
transformation was performed as described for pIN660 and 
cells were grown under either repressed conditions with 
(YEPD) basic complete medium containing 1% yeast extract, 
10 2% bactopeptone and 2% dextrose, or derepressed 

(SD(-)leu) conditions. The latter involved addition of 
2% raffinose as the sole carbon source. Single colony 
yeast transformants carrying the plasmids were grown 
overnight in 5 ml of YEPD medium and then washed twice 
15 with sterile water. A low density inoculum (AgQQjjjjj 

approximately 0.1) was used to initiate growth in 15 ml 
cultures of YEPD or SD(-) leucine and raffinose medium. 
Cultures were harvested by cent rifugat ion when cells 
attained an AgQ^^^ density of at least 2 and were treated 
20 as described above. 

The following example is directed to prepara- 
tion and fractionation of yeast cell extracts and analy- 
sis of the polypeptide produced therein. 

25 EXAMPLE 11 

A. Preparation of Yeast Cell Extracts 

One milliliter of lysis buffer (20 mM Trxs-Cl, 
1% SDS, pH 7.2) was added to each 0.5 g of cells prepared 

30 as described above in Example 10 and the suspension 
boiled for 10 minutes to achieve cell breakage. The 
samples were then centrifuged at 14000 x g to clarify 
the supernatant and these were subsequently analyzed by 
electrophoresis through a 15% discontinuous 

35 SDS-polyacrylamide gel (SDS-PAGE) . Gels of cell extract 
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were shown to contain a protein comigrating with a 
thauinatin standard • Protein was measured by a variation 
of the method of Lowry^ et al« , J.BiQl,Chem, , 193 / 
265-275 (1951) with crystalline bovine serum albumin as 
5 a standard. 

Yeast cells (1 ml) containing the thaumatin 
gene under control of the PGK promoter (plNG60) were 
grown in SD(-)leu medium, to which was added 10 jiCi of 
[^^SJcysteine. The cells were allowed to continue 

10 growth for 1,5 hours at 30**C with shaking after which 
they were harvested and lysed by boiling in pH 7.2 Tris 
buffer containing 1% SDS, as described above. Extracts 
(200,000 cpm per sample) and E^^C] molecular weight 
markers (BRL) were fractionated by SDS-PAGE. After 

15 drying the gel, autoradiography was used to detect newly 

synthesized protein bands which showed high levels of 

[^^S] cysteine incorporation. A prominent band absent 

from cell extracts not containing the thaumatin gene 

appeared at about 22K which comigrated with a sample of 
125 

20 pure [ I] thaumatin. The same procedures performed on 
yeast clones containing the thaumatin gene under control 
of the cycl promoter also showed synthesis of a new 
protein that comigrated with a [ IJ labelled thaumatin 
standard. 

25 The newly synthesized protein present in yeast 

extracts containing the thaumatin gene was also detected 
by protein staining. Prominent bands were noted in the 
extracts of the clone containing plNG60 whose migration 
is coincident with purified thaumatin. Visualization 

30 by Coomassie blue staining and comparison of band inten- 
sity with respect to the other yeast proteins indicates 
that the thaumatin I protein comprises about 10-15% of 
the SDS-soluble protein when PGK controls expression and 
2-3% when cycl is the promoter. Yields of polypeptive 

35 product were in the range of approximately 10 mg OD/liter. 
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B. Western Blot Analysis of Yeast Extracts 

Yeast cell extracts fractionated by SDS-PAGE 
were analyzed for synthetic thaumatin by a Western Blot. 
5 A Bio Rad Trans Blot Cell was used to transfer the 
fractionated yeast proteins from a polyacrylamide gel 
to 0*45 micron nitrocellulose paper by electrophoresis 
at 50 mA for 14 hours. When electrophoresis was com- 
plete, the paper was immersed in 20 ml of 10 mM Tris-Cl, 

10 0.9% NaCl/ pH 7.4 (Buffer B) containing 5% BSA for 30 
minuteSf in 20 ml of Buffer B containing 5% BSA and 
0.2 ml thaumatin-specif ic antiserum (TH2, 5/26/83) for 
90 minutes r and then washed extensively with buffer B 
containing 0.05% Triton-x-100. A third incubation for 

15 60 minutes with Buffer B containing 5% BSA and 100 tiCi 
^^^I-protein A followed by extensive washing labeled 
only those yeast proteins that specifically reacted with 
anti-thauraatin antibodies. These were detected by auto- 
radiography. Two lanes contained 1 fig of thaumatin as 

20 a standard. Five lanes showed a new protein present in 
several yeast cell extracts that cross-reacted with 
thaumatin specific antibodies and migrated coincident 
with pure thaumatin. Extracts from cells not containing 
the thaumatin gene showed no antibody binding and thus 

125 

25 no labeling with [I] protein A. The Western Blot 
clearly shows a protein identical in relative mobility 
(molecular weight) to purified thaumatin that is recog- 
nized specifically by thaumatin antibodies. 

30 c. Radioimmunoassay 

A competitive binding radioimmunoassay (RIA) 
procedure based on published methods [Weickmann/ J.L., 
et al, , Bioch mistry , 20, 1272-1278 (1981); and Parker, 
^5 C.W., Radioimmunoassay of Biologically Active Compounds , 
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Prentice Hall, Englewood Cliffs, New Jersey, 1976)] was 

used to detect and quant itate tbaunatin in crude cell 

4 

extracts. lodinated RNase (usually 4-6 x 10 cpin), 

200 |il of 0.1 mg/ml of nonimmune rabbit seruis and any 

5 competing antigen were mixed in a total volume of 400 {il 

of Buffer A (0.05M Tris-Cl, 0.15M NaCl, and 1 mg/ml of 

egg albumin, pH 7.5). Then 10 iil of an appropriate 

dilution (usually 2 x 10^) of antiserum was added. The 

entire mixture was incubated in capped Beckman Bio-Vials 

iO at 4®C for 8-16 hours followed by the addition of 200 ^il 

of Buffer A containing sufficient goat anti-rabbit 

gamma-globulin to fully precipitate the rabbit globulins. 

A Beckman 5500 counter was used to measure the amount 
125 

of [ I] in each sample. The samples were then incu- 

15 bated at 4"C for 4 hours, followed by centrifugation at 

12000 rpm (22000x g) for 15 minutes. The supernatants 

were removed by aspiration and radioactivity measured 

in the precipitates. In the absence of inhibitors, 

125 

75-80% of the total [I] was precipitated; background 

20 precipitation by non-immune serum alone was about 4* of 
125 

the total [ U . Accurate quantitation of 8-50 ng of 

thaumatin was possible and sensitivity can easily be. 

increased by a factor of 10. The point of 50% inhibition 
125 

of binding of [ II thaumatin is 20ng of cross-reacting 

25 protein when 10 }il of a 1:2000 dilution of specific 

antiserum is used. 

When cell extracts containing newly synthesized 

thaumatin were used as competitors in the RIA, inbibi- 
125 

tion of [ II labelled purified thaumatin for antibody 



30 



35 



binding sites was observed despite losses due to SDS 
interference in the assay and denaturation of the protein 
caused by boiling. 

Numerous modifications and variations in the 
invention are expected to occur to those skilled in the 
art upon consideration of the foregoing description. 
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As one example, while the foregoing illustrative Examples 
are directed to the manufacture of a structural gene for 
microbial expression of thauraatin I# the present inven- 
tion also comprehends the manufacture of genes coding 
5 for thaumatin II, which differs from thaumatin I in only 
five amino acid residues. 

The thaumatin II coding sequence can be con- 
structed by employing the above-noted primer-directed 
mutagenesis fragment excision/religation techniques to 

10 change the five amino acid residues in the thaumatin I 
sequence. Three of the amino acid changes (Asn*^ to 
Lys*^, Ser^^ to Arg^"*, and Lys^^ to Arg^'') could be 
accomplished in a single mutagenesis procedure with two 
primers. The first primer would span the Kpn l site and 

15 change the residue number 46 codon from AAC to AAG. The 
second primer would" contain the base changes necessary 
to effect the changes at residue number 63 and at residue 
number 67. A minimum of two base changes is required 
to alter the codon specifying Ser^^ to one specifying 

20 arginine at that position (e.g., TCC to CGC) and three 
base changes are needed to secure alteration to the 
preferred codon for yeast expression, AGA. Similarly, 

while only a single base change is needed to alter the 

67 67 
codon specifying Lys to one specifying Arg (e.g., 

25 to AGG) , two base changes are needed to secure 

alteration to the preferred codon, AGA. Providing 
glutamine in place of arginine at residue number 76 would 
require two base changes, i.e., AGA to CAA, which could 
be performed in a separate mutagenesis procedure. Simi- 

30 larly, another procedure would be employed to effect the 
codon change from AAC to GAC to replace asparagine with 
aspartic acid at residue number 113. 

Alternatively, the changes in amino acid resi- 
dues 46, 63, 67 and 76 could be effected by synthesis 

3^ of four oligonucleotide fragments which are duplexed. 
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polymerized and digested to replace the Kpn l and EcoRI 
duplez of the thaumatin I sequence in pINGSOl. The 
change at residue 113 would nevertheless require the in 
vitro mutagenesis procedure as described above. 
5 Yet another example of expected modification 

of DNA sequences of the present invention involves the 
manufacture of genes for thaumatin analogs. One attempt 
to alter the sweetness intensity of the polypeptide 
includes selectively changing several of the lysine 

10 amino acid residues in the thaumatin I sequence to 

uncharged amino acids (i.e., glycine, valine). Simi- 
larly, the thaumatin I DNA sequence may be truncated at 
a variety of positions at either or both of its 5' and 
3' termini to secure microbial expression of thaumatin 

15 polypeptide fragments primarily responsible for eliciting 
the sweetness response. 

Consequently, only such limitations as appear 
in the appended claims should be placed on the invention. 
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WHAT IS CLAIMED IS: 

1. A manufactured gene capable of directing 
the synthesis in a selected host microorganism of a 
5 polypeptide having one or more of the biochemical or 
immunological properties of thaumatin* 

2» The gene according to claim 1 wherein said 
polypeptide has one or more of the biochemical or immuno- 
10 logical properties of thaumatin I. 

3. The gene according to claim 1 wherein said 
polypeptide has one or more of the biochemical or immuno- 
logical properties of thaumatin II. 

15 

4. A manufactured gene according to claim 1 
wherein the base sequence includes one or more codons, 
selected from among alternative codons specifying the " 
same amino acid^ on the basis of preferential expression 

20 characteristics of the codon in a projected host micro- 
organism. 

5. A manufactured gene according to claim 1 
wherein the base sequence includes one or more codons ^ 

25 selected from among alternative codons specifying the 

same amino acid, on the basis of preferential expression 
characteristics of the codon in E.coli , 

6. A manufactured gene according to claim 1 
30 wherein the base sequence includes one or more codons, 

selected from among alternative codons specifying the 
same amino acid, on the basis of preferential expression 
characteristics of the codon in Saccharomyces cerevisiae. 

3^ 7. A manufactured gene according to claim 1 

comprising the following sequence of nucleotide bases 
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in the coding strand thereof: 

1 10 
Ala Thr Phe Glu lie Val Asn ftxg Cys Ser Tyr Thr Val 
5"-GCT ACC TTC 6AA ATC GTT AAC AGA TGT TCT TAG ACT GTT 

5 20 

Trp Ala Ala Ala Ser Lys Gly Asp Ala Ala Leu Asp Ala Gly 
TGG GCT GCT GOT TCC AAG G6T GAC GCT GCT TTG GAG GCC GGT 

30 40 
Gly Arg Gin Leu Asn Ser Gly Glu Ser Trp Thr lie Asn Val 
GGT AGA CAA TTG AAC TCT GGT GAA TCC TGG ACC ATC AAC GTC 

50 

10 Glu Pro Gly Thr Asn Gly Gly Lys lie Trp Ala Arg Thr Asp 
GAA CCA GGT ACC AAC GGT GGT AAG ATC TGG GCT AGA ACC GAC 

60 

Cys Tyr Phe Asp Asp Ser Gly Ser Gly lie Cys Lys Thr Gly 
TGT TAC TTC GAT GAC TCT GGT TCC GGT ATC TGT AAG ACT GGT 

70 80 
15 Asp Cys Gly Gly Leu Leu Arg Cys Lys Arg Phe Gly Arg Pro 
GAC TGT GGT GGT TTG TTG AGA TGT AAG AGA TTC GGT AGA CCA 

90 

Pro Thr Thr Leu Ala Glu Phe Ser Leu Asn Gin Tyr Gly Lys 
CCA ACC ACT TTG GCT GAA TTC TCT TTG AAC CAA TAC GGT AAG 

100 110 
Asp Tyr lie Asp lie Ser Asn lie Lys Gly Phe Asn Val Pro 
20 GAC TAC ATC GAT ATC TCC AAC ATC AAG GGT TTC AAC GTT CCA 

120 

Met Asn Phe Ser Pro Thr Thr Arg Gly Cys Arg Gly Val Arg 
ATG AAC TTC TCT CCA ACC ACT AGA GGT TGT AGA GGC GTC AGA 

130 

Cys Ala Ala Asp lie Val Gly Gin Cys Pro Ala Lys Leu Lys 
25 TGT GCT GCT GAC ATC GTT GGT CAA TGT CCA GCT AAG CTT AAG 

140 150 

Ala Pro Gly Gly Gly Cys Asn Asp Ala Cys Thr Val Phe Gin 

GCT CCA GGT GGT GGT TGT AAC GAC GCT TGT ACC GTT TTC CAA 

160 

Thr Ser Glu Tyr Cys Cys Thr Thr Gly Lys Cys Gly Pro Thr 
ACT TCC GAA TAC TGT TGT ACC ACT GGT AAG TGT GGT CCA ACC 



30 



35 



170 180 
Glu Tyr Ser Arg Phe Phe Lys Arg Leu Cys Pro Asp Ala Phe 
GAA TAC TCT AGA TTC TTC AAG AGA TTG TGT CCA GAC GCT TTC 

190 

Ser Tyr Val Leu Asp Lys Pro Thr Thr Val Thr Cys Pro Gly 
TCC TAC GTC TTG GAC AAG CCA ACT ACC GTC ACT TGT CCA GGT 

200 

Ser Ser Asn Tyr Arg Val Thr Phe Cys Pro Thr Ala 
TCT TCC AAC TAC AGA GTT ACC TTC TGT CCA ACT GCC- 3' 
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8. A inanufactur d gene according to claim 1 
wherein base codons specifying a polypeptide having one 
or more of the biochemical or immunological properties 
of thaumatin include initial and/or terminal codons 

5 respectively specifying additional initial and/or 
terminal amino acids in the polypeptide synthesized. 

9. A manufactured gene according to claim 8 
wherein said initial codons specifying additional initial 

10 amino acids are codons specifying an initial methione 
residue* 

10. A manufactured gene according to claim 1 
wherein the base codons specifying a polypeptide having 

15 the biological properties, of thaumatin are preceded 
and/or followed by a sequence of bases comprising a 
portion of a base sequence which provides a recognition 
site for restriction endonuclease enzyme cleavage. 

20 11. A manufactured gene according to claim 

10 including the sequence of nucleotide bases set out 
in claim 5 preceded by the sequence 5'-GGATCCCG-3 ' and 
followed by the sequence 5 '-TAATGACTCGAG-3 S 

25 12. A fusion gene comprising a manufactured 

gene according to claim 1 fused to a second gene capable 
of directing synthesis of a second polypeptide in a 
manner permitting the synthesis of a fused polypeptide 
including the polypeptide product coded for by a manu- 
factured gene of claim 1 and said second polypeptide. 

13. A fusion gene according to claim 12 
wherein said second gene is a gene directing synthesis 
of S. typhimiirium L-ribulokinase enzyme. 



30 
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14. A biologically functional DNA microorgan- 
isin transformation vector including a manufactured gene 
according to claim 1. 

5 15 • A biologically functional DNA microorgan- 

ism transformation vector including a fusion gene 
according to claim 12. 

16. A vector according to either of claims 
10 14 or 15 which is a circular DNA plasmid. 

17. A vector according to claim 14 or 15 
further including a regulatable selected promoter/regu- 
lator DNA sequence. 



15 



20 



18. A vector according to claim 17 wherein 
said promoter/regulator DNA sequence is selected from 
among the group consisting of sequences duplicating the 
promoter/regulator sequences associated with yeast syn- 
thesis of 3-phosphoglycerate kinase or iso-l-cytochrome 
C or S.typhimurium synthesis of L-ribulokihase. 



19. A vector according to claim 17 wherein 
said promoter/regulator DNA sequence is a tac promoter/ 
25 regulator sequence. 



20. A vector according to claim 14 or 15 
further including a selected terminator DNA sequence. 

21. A vector according to claim 20 wherein 
said terminator DNA sequence is selected from among the 
group consisting of sequences duplicating the terminator 
sequences associated with yeast synthesis of 3-phospho- 
glycerate kinase or iso-l-cytochrome C. 
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22. A microoganism transformed with a vector 
according to claim 14 r 15 r 16, 17 or 20. 

23. A process for the production of a poly- 

5 peptide having one or more of the biochemical or immuno- 
logical properties of thaumatin comprising: 

growing, under appropriate nutrient conditions, 
microorganisms transformed with a biologically functional 
DNA including a manufactured gene according to claim 1/ 
10 whereby said microorganisms express said gene and produce 
said polypeptide. 

24. A process according to claim 23 wherein 
the microorganisms grown are B.coli microorganisms. 

15 

25. A process according to claim 23 wherein 
the microorganisms grown are S.cerevisiae microorganisms. 

26. A polypeptide product of the expression 
20 in a microorganism of a manufactured gene according to 

claim 1. 

27. A polypeptide product of the expression 
in a microorganism of a manufactured gene according to 

25 claim 11. 

28. An improved process for selectively 
altering the nucleotide sequence of a double-stranded 
plasmid DNA sequence having at least two unique restric- 

30 tion endonuclease enzyme recognition sites, wherein a 
single-stranded primer oligonucleotide containing the 
selected alteration is annealed to one strand of said 
double-stranded DNA sequence and extended to form a 
partial complement thereof, the improvement comprising 

35 the steps of: 
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(a) linearizing said double^stranded plasinid 
DNA sequence by restriction endonuclease enzyae digestion 
at a first unique recognition site in said sequence; 

(b) denaturing the linearized double-stranded 
5 DNA sequence formed in step (a) into two complementary 

linear single-stranded DNA sequences; 

(c) annealing said primer to one of said linear 
single-stranded sequences formed in step (b) and extend- 
ing said primer to form a partially double-stranded DNA 

10 sequence; 

(d) denaturing said partially doable-stranded 
sequence formed in step (c) into the original plasmid- 
derived single-stranded sequence and a primer-derived 
single-stranded DNA sequence; 

15 (e) linearizing said double-stranded plasmid 

DNA sequence by restriction endonuclease enzyme digestion 
at a second unique recognition site in said seq[uence; 

(f) denaturing the linear sequence formed in 
step (e) into two complementary single-stranded DNA 

20 sequences; 

(g) annealing the primer -derived single- 
stranded DNA sequence formed in step (d) to a plasmid- 
derived complementary single-stranded sequence formed 
in step Cf ) ; 

25 (h) recircularizing the annealed strands of 

step (g) into a double-stranded DNA plasmid with an 
alteration in one strand; 

(i) transforming a host microorganism with the 
plasmid formed in step (h) and isolating daughter cell 

30 populations containing plasmids with the selectively 
altered sequence by hybridization with said primer; and 

(j) transforming a host microorganism with the 
selectively altered plasmids obtained from the hybridiza- 
tion of step (i) . 

35 
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29. An improved process for selectively 
altering the nucleotide sequence of a double-stranded 
plasmid DNA sequence having first, second and third 
unique restriction endonuclease enzyme recognition sites, 
5 wherein a single-stranded primer oligonucleotide con- 
taining the selected alteration is annealed to one 
strand of said double-stranded DNA sequence and extended 
to form a partial complement thereof, and the plasmid 
DNA sequence to be altered is between said second and 
10 third recognition sites, the improvement comprising the 
steps of: 

(a) linearizing said double-stranded plasmid 
DNA sequence by restriction endonuclease enzyme digestion 
at said first recognition site in said sequence; 

15 (b) denaturing the linearized double-stranded 

DNA sequence formed in step (a) into two complementary 
linear single-stranded DNA sequences; 

(c) annealing said primer to one of said linear 
single-stranded sequences formed in step (b) and 

20 annealing to the same linear sequence a second primer 
complementary to a portion of said sequence between said 
first and second recognition sites and extending said 
primers to form a partially double-stranded DNA sequence 
including both said second and third recognition sites; 

25 (d) cleaving said partially double-stranded 

sequence formed in step (c) by restriction endonuclease 
digestion at said second and third restriction sites to 
form a double-stranded sequence which is a hybrid of a 
plasmid single strand and said primer extended strands; 

(e) inserting said fragment formed in step (d) 
into said DNA plasmid to form a double-stranded plasmid 
including said alteration in one of its strands; 

(f) transforming a host microorganism with the 
plasmid formed in step (e) and isolating daughter cell 
populations containing double-stranded plasmids with the 
selectively altered sequence by hybridization with said 
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primer; and 

(g) transforming a host microorganism with the 
selectively altered plasmids obtained from the hybridiza- 
tion of step (f ) . « 
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